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Abstract 

fecAgrA-oi//7£/;Alpha-hemolysin (HIa) is a major virulence factor in the pathogenesis Staphylococcus aureus infection, being 
active against a wide range of host cells. Although hia is ubiquitous in S. aureus, its genetic diversity and variation in 
expression in different genetic bacl<grounds is not known. We evaluated nucleotide sequence variation and gene 
expression profiles of hIa among representatives of hospital (HA) and community-associated (CA) S. aureus clones. 

Methods: 5} methicillin-resistant S. aureus and 22 methicillin-susceptible S. aureus were characterized by PFGE, spa typing, 
MLST and SCCmec typing. The internal regions of h\a and the hla promoter were sequenced and gene expression was 
assessed by RT-PCR. 

/fe5i//ts; Alpha-hemolysin encoding- and promoter sequences were diverse, with 12 and 23 different alleles, respectively. 
Based on phylogenetic analysis, we suggest that hla may have evolved together with the S. aureus genetic background, 
except for ST22, ST121, ST59 and ST93. Conversely, the promoter region showed lack of co-evolution with the genetic 
backgrounds. Four non-synonymous amino acid changes were identified close to important regions of hla activity. Amino 
acid changes in the RNAIII binding site were not associated to hla expression. Although expression rates of hla were in 
general strain-specific, we observed CA clones showed significantly higher hla expression (p = 0.003) when compared with 
HA clones. 

Conclusion: We propose that the hla gene has evolved together with the genetic background. Overall, CA genetic 
backgrounds showed higher levels of hla expression than HA, and a high strain-to-strain variation of gene expression was 
detected in closely related strains. 
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introduction 

Staphylococcus aureus is a human opportunistic pathogen 
responsible for a wide range of infections that can vary in its 
clinical presentation and severity. Methicillin-resistant S. aureus 
(MRS A) emerged in 1 960 in the United Kingdom and has been a 
major problem in hospitals (HA-MRSA) worldwide during the last 
40 years; however since the late 1990s, MRSA has been emerging 
as a leading cause of severe infection also in the community, in 
individuals without recent health-care contact (CA-MRSA) [1,2]. 

CA-MRSA present distinct genetic backgrounds from their 
hospital counterparts, are more susceptible to antibiotics other 



than beta-lactams, carry the smallest staphylococcal cassette 
chromosome mec types (SCCmec IV or V), and have higher 
virulence capacity [1,2,3]. The underlying reasons behind the 
enhanced virulence of CA-MRSA appear to be multiple including 
a different capacity to overcome host cell response [4], different 
distribution of mobile genetic elements carrying virulence deter- 
minants [5], allelic variation in virulence determinants located in 
the core genome and in mobile genetic elements [6] , and different 
levels of expression and protein production of virulence determi- 
nants (alpha-hemolysin, collagen adhesin, staphylokinase, coagu- 
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lase, lipase, enterotoxins C3 and Q, V8 protease and cysteine 
protease) [7,8,9]. 

The alpha-hemolysin or a-toxin (HIa), is one of the major 
virulence determinants implicated in the pathogenesis of S. 
aureus, associated to severe skin and soft tissue infections (SSTI), 
necrotizing pneumonia and even sepsis [10]. HIa is the most 
prominent S. aureus cytotoxin that can act against a wide range of 
host cells including erythrocytes, epithelial cells, endothelial cells, 
T cells, monocytes and macrophages [10,11,12]. The gene 
encoding HIa is located in the core genome and is expressed as 
a water-soluble monomer (33.2 kD) that assembles to form a 
membrane-bound heptameric ^-barrel pore (232.4 kD) on 
susceptible cells leading to cell death and lysis [11]. The overall 
structure is mushroom-like, divided into three domains: 1) Cap 
domain: largely hydrophobic, defining the entry of the pore; 2) 
Rim domain: underside of the Cap, in close proximity to 
membrane bilayer; 3) Stem domain: part of the transmembrane 
channel, forming the membrane-perforating fi-barrel pore 
(Figure 1) [10,11]. HIa expression is mainly controlled by the 
global toxin accessory gene regulator {agr), via the regulatory 
effector molecule RNAIII [13]. While agr provides the first and 
most important mechanism of up-regulation oihla, expression can 
also be modulated by other regulators, such as SaeR, SarZ, ArlS 
[14,15,16] (up-regulators) and Rot, SarT [17] (down-regulators). 

Although polymorphisms in the hla promoter region have been 
described [18], the range of genetic diversity and evolution of this 
toxin has never been assessed in a large representative S. aureus 
collection. Furthermore, although differences in hla expression 
have been described between community- and hospital-associated 
MRSA, these studies have been performed with a limited number 
of CA-MRSA epidemic clones [9], or almost exclusively with 
representatives of the USA300 clone [19,20,21]. To better 
understand the evolutionary history of hla and its importance as 
a virulence factor for CA-MRSA, in this study we compared the 
hla nucleotide sequence and expression among the major 



epidemic and minor CA and HA clones, including both MRSA 
and MSSA strains. 

Materials and Methods 

Ethics Statement 

Isolates were obtained from routine diagnostic and were 
analyzed anonymously and only the isolates, not humans, were 
studied. All data was collected according to the European 
Parliament and Council decision for the epidemiological surveil- 
lance and control of communicable disease in the European 
community. Ethical approval and informed consent were for that 
reason not required. 

Bacterial collection 

A total of 73 S. aureus, including 51 MRSA and 22 MSSA 
were analyzed in this study. Strains were collected in 13 different 
countries (Belgium, Bulgaria, Czech Republic, Denmark, Greece, 
Netherlands, Portugal, Romania, Spain, Sweden, United King- 
dom, USA and Brazil), between 1961 and 2009 from both 
community (n = 46) and hospital (n = 27). The strains comprised a 
total of 52 spa types and 23 sequence types (STs) (see Table SI). 

Strains were defined as belonging to CA or HA clones if they 
contained the same or related genetic backgrounds as the 
reference CA-MRSA and HA-MRSA epidemic control strains, 
based on ST, spa type and SCCmec (in case of MRSA). 

Media and bacterial growth conditions 

Before RT-PCR analysis, strains were grown overnight at 37°C 
on tryptic soy agar plates (TSA). Bacterial growth experiments 
were performed by growing bacteria in tryptic Soy Broth (TSB) at 
37°C with shake and measuring OD (600 nm) each hour in the 
follow up automatic incubator Bioscreen C (Oy Growth Curves 
AB, Helsinki, Finland). Plates of 100-well honeycomb (Oy Growth 
Curves AB, Helsinki, Finland) were filled with 300 |a,l/weU of 
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Figure 1. HLA protein structure. A) wildtype (highlighted the non-synonymous mutations Gln87, Glu208, Thr239 and Ser243) and B) truncated 
protein due to a stop codon at Gln87. Structure generated by the program PyMOL v.1.6. 
doi:1 0.1 371 /journal.pone.0098634.g001 



PLOS ONE I www.plosone.org 



2 



July 2014 I Volume 9 | Issue 7 | e98634 



HIa Evolution and Expression in S. aureus from Community and Hospital 



overnight culture diluted to ODgoo = 0.05 in TSB growth medium. 
Three individual growth experiments (SetC, SetD and SetE) were 
performed for each strain and named accordingly e.g. HLZ6C, 
HLZ6D and HLZ6E (see Figure S2.I to III). 

Nucleotide sequence of hia and promoter region 

Chromosomal DNA was extracted from overnight cultures, 
using the boiling method (100°C for 10 min followed by 
ccntrifugcd at 13. ()()() g for 5 minutes). Two sets of primers were 
designed to span the most polymorphic regions within the hla gene 
and hla promoter (considered as the region located — 600 bp from 
hla starting codon), after alignment of sequences available on 
NCBI for S. aureus. One set of primers (Forward: hla-F_CGAA- 
AGGTACCATTGCTGGT; Reverse: hla-R_CCAATCGATTT- 
TATATCTTTC) amplified an internal fragment of the hla gene 
(nt 1170419-1170982, CP000730.1) and the other .set (Forward: 
hlaPro-F_CACTATATTAAAAATACATAC; Reverse: hlaPro- 
R_GTTGTTACTGAGCTGAC) amplified an internal fragment 
of the hla promoter region (nt 1171289-1171773, CP000730.1) 
(Figure S3). PGR products were sequenced (Macrogen Europe, 
Amsterdam, The Netherlands) and sequences were analyzed using 
SeqMan (DNAstar, Lasergene v9, Madison, WI, USA). To each 
unique hla promoter (P) and gene sequence (hla) - allotype - a 
single Arabic number was attributed (e.g. PI, P2; hla\,hla2). Gene 
and promoter sequences were deposited in GenBank (accession 
numbers KM019547-KM0 19606; KM019607-KM019674). 

Phylogenetlc analysis 

Phylogenetic relatedness was analyzed using the MEGA.^ v.5.05 
software (http://www.megasoftware.net/) for gene, promoter 
region and concatenated sequences obtained from 1) gene with 
promoter region and 2) seven MLST alleles from the 23 
representative STs within the collection. Phylogenetic trees were 
constructed using the Neighbor-Joining clustering method, and 
1000 bootstrap replicates, which assigns confidence values for the 
groupings in the tree. 

Moreover, nucleotide diversity (ND) between the two clusters 
was calculated based on the estimation of the average evolutionary 
divergence over sequence pairs within the two groups, where the 
number of base substitutions per site from averaging over all 
sequence pairs within each group are compared using the 
maximum composite likelihood model [22]. 

Detection of recombination 

Alignments from the hla gene, hla promoter and internal 
fragments of each of the seven MLST gene were screened for the 
occurrence of putative recombination events using Recombination 
Detection Program version 4 (RDP4) (http://web.cbio.uct.ac.za/) 
with the default settings (with highest acceptable probability value 
of 0.05). Identification of recombinant sequences recombination 
breakpoints and major parent was determined using simulta- 
neously nine recombination detection methods (RDP, BOOT- 
SCAN, GENECONV, MAXCHI, CHIMAERA, SISCAN, 
PhylPro, LARD and 3SEQ. The "minor parent" is considered a 
sequence closely related to that from which sequences in the 
proposed recombinant region may have been derived (the 
presumed donor). The "major parent" was considered as a 
sequence closely related to that from which the greater part of the 
recombinant's sequence may have been derived. 

RT-PCR analysis 

Culture growth was stopped at late exponential phase, when 
alpha-toxin is described to have maximal activity [23], corre- 



sponding to the time-points 1) 3 hours 30 min in one group (65 
strains) and 2) 4 hours 30 min in another (8 strains). Total RNA 
was extracted from three biological replicates. Cells were 
mechanically disrupted with FastPrep-24 Instrument (MP Bio- 
medicals, Solon, OH, USA) and RNA was protected using RNA 
Protect (Qiagen, Valencia, USA). RNA was extracted automati- 
cally using the QIAsymphony platforms (Qiagen, Valencia, USA) 
with QIAsymphony RNA kit (Qiagen, Valencia, USA). 

The RT-PCR assay was performed on a 7500 Real-Time PGR 
System (Apphed Biosystems, Foster City, CA) using the following 
primers and TaqMan probes: Hla RT_F: TAATGAATCCTG- 
TCGCTAATGCC; HlaRT_R: CACCTGTTTTTACTGTAG- 
TATTGCTTCC; Hla RT Probe: 6FAM-AAACCGGTACTA- 
CAGATAT-MGBNFQ. The RT-PCR reaction was performed 
using the EZ RT-PCR Core Reagents (Applied Biosystems, Foster 
City, USA), in which RNA is reverse transcribed and amplified in 
a single reaction. The following PGR protocol was used: 50°C for 
2 min, 60°C for 30 min, 95°C for 5 min, followed by 42 cycles of 
95°C for 20 sec and 62°C for 1 min. The 16S gene was used as 
internal or reference control. The primers used for 168 RNA 
amplification were those previously described [24]. 

RT-PCR data analysis 

The relative hla gene expression was calculated based on the C, 
(RT-PCR output) of the gene of interest (Ct hla) as compared to 
the Ct of the internal control (C, 16S) as follows: Delta Ct = Ct hla- 
Ct 16S. The lower the Delta C, the higher is the amount of hla 
mRNA and the more the gene is expressed. The reproducibility of 
the assay was evaluated by the calculation of the arithmetic mean 
of the relative expression of the three biological replicates (Mean 
Delta Cti_3= Average (Delta Cti; Delta Ct2; Delta Cta). The 
reproducibility of RT-PCR reaction was evcduated by the 
calculation of the standard deviation (STDEV) of Delta C, 
obtained for each biological replica (Delta Cti; Delta Ct2; Delta 
C,3). Values were considered valid when at least two Ct readings 
exist witir STDEV<2. 

Protein structure visualization (pyMOL) 

The protein structure was modeled using PyMOL v. 1 .6 (http:// 
www.pymol.org/) if a nucleotide mutation gave rise to a stop 
codon. 

Statistical analysis 

The statistical analysis was performed using the Graphpad 
Prism 6 (http:/ /www.graphpad.com/scientific-software/prism/), 
with the two-tailed Student's t-test to determine whether the 
differences of mean expression rates (MSSA versus MSSA; HA 
backgrounds versus CA backgrounds) were statistically significant 
(p<0.05). 

Regression tree analysis was used to explore which variables 
could be related with the hla expression [25]. Trees explain the 
variation of a single response variable (in this study the hla mRNA 
expression) by repeatedly splitting the data into more homoge- 
neous groups, using combinations of explanatory variables (in our 
case, the ST, spa type, MRSA, MSSA and the type of SCCwc). 

Results 

Analysis of polymorphisms in the hla gene and hla 
promoter 

The sequence analysis of the internal region of hla and the hla 
promoter region among the 73 strains identified a total of 12 hla 
and 23 promoter region different sequences (allotypes) (Table 1). 
We obtained no amplification products for hla and hla promoter 
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region in one and 13 strains, respectively, which probably result 
from misparing of the primers used. 

From the 12 hla (A/a 1-1 2), we observed that only a single hla- 
allotype was found among representatives of a specific ST, except 
for ST22 (A/a 12; A/a 13) and ST30 {hlaS; hla9) where two difierent 
alleles were identified. On the other hand, the most frequent 
aUeles, hlal (33.3%, n = 24) and hla4 (20.8%, n=15), were 
identified in more than one ST. 

Regarding the nucleotide changes identified in the hla, some 
correspond to non-synonymous mutations (E208, T239 and S243) 
and, in one particular case, to a stop codon (Table 1 and 2). The 
substitutions observed did not correspond to any difference in the 
charge or polarity of the amino acid (aa). However, changes in 
molecular weight were observed: i) changes from aa D208 to aa 
E208 (D208E) and from aa S239 to T239 (S239T) gave rise to a 
higher molecular weight aa; and ii) change from aa T243 to S243 
(T243S) resulted in a lower molecular weight aa; of note all 
changes occurred in the Rim domain of the protein. In a particular 
case, the aa change gave rise to a stop codon located in the CAP 
domain, in strains of ST36. Protein structure modeling showed 
that a protein of about one third of its real size is produced, 
truncated at the Gln87 (Figure 1, A and B). The truncation is in 
the outside part of the domain, suggesting that this will affect the 
capacity of the Hla to form cell wall pores, and ultimately to 
induce hemolysis. 

A high number of sequence variations were identified in the hla 
promoter region, (n = 23) (Pl-23) (Table 1 and 2). Although we 
found that some STs were associated to a specific promoter 
allotype, and some promoters were identified in a single ST, we 
also identified cases where single STs were associated to different 
promoters (8 out of 23) and examples in which a single promoter 
allotype was associated to different STs (5 out of 23). This is the 
case of the most frequent promoter (PI) that was found in about 
one third of the strains analyzed (25.4%, n= 16), including several 
difierent STs. 

A particular highly polymorphic region corresponding to nt 
— 22 to —24 from the start codon, was found in the majority (16 
out of 23) of the promoter allotypes (exceptions PI, P6, P13, P14, 
P15, P18 and P23). These polymorphisms are located in the 
vicinity of RNAIII binding site [26] ; however, we could not find a 
direct correlation between a particular nucleotide sequence and a 
specific expression pattern (high or low expression). For example, 
the sequence TTT, observed in two strains belonging to ST398 
that have a high level expression, was also observed in strains with 
low expression belonging to other genetic backgrounds (ST8, 
ST239, ST247, ST250, ST36, ST45 and ST22). 

Alpha-hemolysin evolutionary history 

In order to better understand the evolution of A/a gene within 
the S. aureus population, we constructed phylogenetic trees from 
the hla and hla promoter sequences, separately or concatenated 
(Figure 2, A) and compared it with the tree constructed from the 
concatenated sequences of the seven housekeeping genes used in 
MLST, including all the STs represented in the strain collection 
described here (Figure 2, B). 

The phylogenetic tree constructed for the hla gene showed two 
distinct major clusters with different evolutionary clocks that 
differed in their nucleotide diversity (ND, see Materials and 
Methods): cluster (L) with lower diversity (ND = 0.005), and cluster 
H with higher diversity (ND = 0.019). Cluster L included more 
than 70% of strains (71.2%, n = 52), and five sub-clusters; Cluster 
H contained about 29% of the strains (28.8%, n = 21), and 
comprised four minor sub-clusters including hla8-hlal2 alleles. 



6 



July 2014 I Volume 9 | Issue 7 | e98634 



HIa Evolution and Expression in S. aureus from Community and Hospital 



Table 2. Strains data distribution based on promoter allotypes. 







Promotoi* 
allotype 


Gene 
allotype 


Non Synonymous 
Mutation 


Isolates Molecular 
Characterization 


Expression Category 


CA backgrounds 


ST398 


P13 


hian 




ST398-IV, tOl 1 


High expression 






P12 






ST398, t034 


High expression 






NT 






ST398-VII, tl08 


High expression 






P12 






ST398-IV, tOl 1 


High expression* 




ST25 


P6 


hial 


D208E 


ST25, 1258 


High expression 












ST25, t081 


High expression* 












ST25, 12909 


High expression 




ST9 


P22 


hial 


D208E 


ST9, tlOO 


High expression* 




ST93 


P21 


hla7 




ST93-IV, 1202 


High expression 






NT 






ST93-IV, t1819 


High expression 




ST121 


P2 


hla6 




STl 21, tl59 


High expression 






PI 






STl 21, t435 


High expression 






NT 






STl 21 -IV, t308 


High expression 






PI 






STl 21, t284 


Low expression* 




ST72 


P14 


hial 


D208E 


ST72-IV, t148 


High expression 






P14 






ST72, 13682 


High expression 






P14 






ST1810-IV, tl346 


High expression 






P14 






ST72, tl26 


High expression* 






PI 5 






ST72-IV, 1791 


Low expression 






P14 






ST72-IV, tl48 


Low expression 






P14 






ST72, t148 


Low expression 




ST80 


P16 


hial 


D208E 


ST80-lcV, tl31 


High expression 












ST80-IV, t044 


High expression 












ST80-IV, t044 


Low expression 




ST30 


P7 


hla9 


D208E; S239T 


ST30, 1342 


High expression 






NT 






ST30-IV, t019 


High expression 






P7 






ST30-IV, t019 


Low expression 






P9 






ST30, 1318 


Low expression 






NT 






ST30-IV, t019 


Low expression 






P8 


hIaS 


D208E; S239T; stop codon 


ST30, to 12 


Low expression 




ST15 


P2 


hla1 


D208E 


STl 5, t084 


High expression 












STl 5, t346 


Low expression* 












STl 5, tl509 


Low expression 




ST59 


P20 


hIaS 




ST59-IV, t216 


High expression 






PI 9 






ST59-IV, 1437 


Low expression 






P19 






ST59-V, t437 


Low expression 




STl 


P17 


hla2 




STl -IV, tl381 


Low expression 






P17 






STl -IV, tl27 


Low expression* 






P18 






STl, tl27 


Low expression 




ST8 


PI 


hla4 




ST8-IV, t008 


Low expression 












ST8-IV, t024 


Low expression 












ST8-IV, t008 


Low expression* 












ST8, t008 


Low expression 












ST612-IV, tl257 


Not valid** 












ST8, t024 


Low expression 




ST97 


P23 


hla3 




ST97, t267 


Low expression 


HA backgrounds 


ST22 


P10 


hial 3 


S239T; T243S 


ST22-IV, t032 


High expression 






PIO 


hla12 




ST22-IV, t022 


Low expression 






P11 






ST22-IV, t022 


Low expression 



PLOS ONE I www.plosone.org 



7 



July 2014 I Volume 9 | Issue 7 | e98634 



HIa Evolution and Expression in S. aureus from Community and Hospital 



Table 2. Cont. 





Promotor 
allotype 


Gene 
allotype 


Non Synonymous 
Mutation 


Isolates Molecular 
Characterization 


Expression Category 




PIO 






ST1806, t5951 


Low expression 




NT 






ST1806-IV, t747 


Low expression 


ST5 


NT 


hial 


D208E 


ST5-II, t002 


High expression 




P3 






ST5-VI, t002 


Low expression 




P3 






ST5-VI, t062 


Low expression 




P3 






ST5-VI, t311 


Low expression 




P4 






ST5-II, t002 


Low expression* 




PS 






ST5, t002, 


Not valid** 




P5 






ST5-II, t067 


Low expression 


ST36 


P7 


hIaS 


D208E; S239T; stop codon 


ST36-II, t018 


Low expression* 




P7 






ST36-II, t021 


Low expression* 




NT 






ST36-II, tOl 


Low expression* 


ST45 


NT 


hla10 


S239T 


ST45-IV, t015 


Low expression 




NT 






ST45-V, t015 


Low expression 




P7 






ST45-IV, t004 


Low expression 


ST239 


PI 


hla4 


_ 


ST239-IIIA, t037 


Low expression 










ST239-III, t037 


Low expression* 










ST239-IIIA, tl38 


Low expression 


ST247 


PI 


hla4 




ST247-I, t051 


Low expression 










ST247-IA, 051 


Low expression 










ST247-IA, t051 


Low expression 










ST247-IA, t725 


Low expression 


ST250 


PI 


hla4 




ST250-I, t008 


Low expression 




P2 






ST250-I, t008 


Low expression 



(*)(••) relative expression values not valid (SDV<2 or only one Cj reading). 
doi:1 0.1 371 /journal.pone.0098634.t002 



which were found in strains of ST30, ST36, ST45, ST398 and 
ST22. 

As opposed to the phylogenetic tree constructed from hla gene, 
the one constructed from the promoter region did not show two 
distinct evolutionary branches (Figure SI). Moreover, dissimilar 
subgroup clustering was noticed in the tree constructed from the 
promoter gene sequence. For example, ST45, ST30 and ST36 
backgrounds were clustered together in the promoter sequence- 
based tree whereas in the hla sequence-based tree ST45 was 
placed separately from ST30 and ST36 cluster (branch H). The 
same type of observations can be drawn for most of STs. Overall 
the promoter region showed to be more diverse than the hla gene 
sequence among the dilferent backgrounds. 

On the other hand, when we compared the phylogenetic tree 
constructed with the hla gene with that constructed from MLST 
concatenated genes, the same type of division into two distinct 
main clusters was observed (Figure 2). Moreover, the majority of 
STs were equally distributed between the two clusters in the two 
trees. The only exceptions were ST22, ST121, ST59 and ST93 
that in the two trees have exchanged their positions from one 
cluster to the other (Figure 2, B-blue arrows). 

Detection of recombination in hla gene, hla promoter 
and MLST genes 

To understand if recombination could explain the incongruence 
found between the trees constructed from hla and MLST 



concatenated genes, we screened the hla gene, hla promoter and 
each MLST gene for recombination events using the RDP4 
software. 

The SiScan and 3Seq methods detected one recombination 
event in the hla gene. This event corresponded to a fragment 
ending in positions 385-410 of the hla alignment, however the 
beginning breakpoint was not possible to determine. In the 
collection analyzed this event was detected in five isolates 
belonging to ST22 or related STs (HSMB280, HDE461, 
HAR22 and LBM12 (TLV ST22) and HSMB184 (TLV ST22)) 
and four isolates of ST398 (LBM54, LBM40, C496, 
C482_ST398). The ST30 HFF204 strain was identified as the 
minor parent (97.8% identity with ST22 strains and —99.3% 
identity with ST398 strains) and ST121 strain HUC587 was 
identified as the major parent (with 100% identity to ST398 strains 
and 93.5-95.2%) identity with ST22 strains) of the recombining 
fragment. A trace signal of recombination of this same event was 
also identified among ST45 isolates; however this signal was not 
statistically significant. Interestingly all the recombination events 
were detected in strains belonging to the high genetic diversity 
cluster in the tree constructed from hla gene. In the hla promoter 
region no recombination events were detected. 

We have performed the same type of analysis using the internal 
sequences of each of the seven housekeeping used in MLST 
scheme, including the alleles present in all STs identified in this 
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HLZ6STS 
LBM23ST59 
HSJ07Sn2 
USA700ST72 
HDE288ST5 
HFA28ST72 
C157ST15 
HDES26ST5 
HBR73ST5 
LBM25ST15 
C095ST25 
HSA29ST5 
C013ST5 
C003ST72 

— C230ST15 
C238ST72 
HBA33ST25 
COMSTeO 

— BK2464ST5 
C141ST25 
C168ST80 
C485STS0 

SAMS1024ST1810SLVST72 
HUC594ST72 
C270ST1 
USA400ST1 
r-|lLBM36ST1 
— HFX84ST97 
10395ST250 
HPV107ST247 
C438ST8 
HFX74ST8 
BK1953ST247 
C574ST612DLVST8 
HFX77ST239 
CCH.ST250 
HSJ419ST247 
E2125ST247 
HUC343ST239 
HU25ST239 
LBM74ST8 
LBM27ST8 
USA300ST8 
C577ST59 
C583ST59 
C434ST59 
HUC578ST121 
HUC574ST12t 
HUC587ST121 



ST5 (hlal; P3/P4/P5) 
ST25 (hlal; P6) 
ST15 (hlal; P2] 
ST9 (hlal; P22) 
ST80 (hlal; P16) 

ST72 and ST1810 (SLV ST72) (hlal; P14/P15) 



HAR38ST45 
C563ST45 

C479ST30 
HFF204ST30 
C017ST30 
HUC585ST30 
C385ST30 

DEN44I5ST36 
HFA30ST30 
HAR24ST36 
HPH2ST36 



STl (hla2; P6 PIO) 
ST97 (hla3; P23) 



ST8, ST250, 
ST247, ST239 
(hla4; P1/P2) 



ST59 (hlaS; P19/P20) 

ST121 (hla6; P1/P2) 
ST93 (hla7; P21) 



ST22, ST1806 (TLVST22) 
(hlal2,hlal3; PlO/Pll) 



ST398 

(hlall; P12/P13) 
ST45 

(hIalO; P7) 
ST30 

(hla9; P7/P9) 

ST36 
(hlaS; P7) 
ST30 
(hlaS; P8) 
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Figure 2. Phylogenetic trees of hia gene (A) and concatenated sequences of MLST alleles (B) from 23 STs representatives of the 
strains collection. The tree was constructed using MEGA 5 with Neighbour-joining method and bootstrap values provided as percents over 1000 
replications. Branch length values are indicated and the percentage of replicate trees (bootstrap test) are shown next to the branches. The dashed 
line indicates the separation of the two evolutionary branches. 
doi:l 0.1 371/journal.pone.0098634.g002 



study, however no recombination events were detected in any of 
the genes. 

Altogether the data gathered suggest that for the majority of 
strains hla gene evolved together with the genetic background. 
The different clustering of ST22 and ST121 strain.s, in the trees 
constructed from MLST concatenated genes and hla gene, may 
derive from recombination events occurring in the hla gene. 
Similarly these type of events might explain the genetic diversity 
observed in cluster H in the hla tree in strains belonging to ST22, 
ST398, ST45, ST30 and ST36 (H cluster of hla tree). 

Expression of alpha-hemolysin 

The expression of alpha-hemolysin in the 73 strains was assessed 
by RT-PCR, in three biological replicates. Fifteen of the 73 strains 
(20.5%) were excluded from the fmal analysis, either because a 
single vahd determination for Delta Cj (N — 2) was obtained or 
because Ct obtained from the different biological replicates were 
not reproducible (N= 13). 

The analysis of the regression tree split the response variable 
into two distinct groups, according to the spa type of the strains. 
There was a group of strains with mean Delta C,i..3<5.73, that 
was classified as a high expression group and a second group with 
a mean Delta Cti_3>5.73 classified as a low expression group 
(Table 1, Table 2 and Figure 3). Overall the regression tree 
explained 60% of the variance in the data. This is mostly because 
there were strains expressing a low or high mean Delta Ct that 
were classified in the same spa type; those were the cases of spa 
types t002, t019, t044 and t437. 

Furthermore, we explored in each of the spa types what other 
explanatory variables (ST, MRSA, MSSA and type of SCCmec) 
could differentiate the inclusion of some strains in the low or high 
expression group, but we found no associations with the variables 
we measured in the study. 

We observed that the hla expression level varied within strains 
of the same ST (Figure 3; Table 1 and 2). In fact, in some cases the 
same ST comprised strains with both high and low levels of 
expression (STS, ST15, ST22, ST30, ST59, ST72 and ST80). 
Moreover, we found that the expression rates did not differ 
significantiy (P = 0.665) between MRSA and MSSA strains. 
However, we did find a correlation between the hla expression 
and the origin of the genetic backgrounds. Actually, strains of GA 
genetic backgrounds showed, in general, higher mean expression 
rates than strains of HA backgrounds (p = 0.003) (Figure 4). 
Among the 21 strains (36.2%, 21 out of 58) with high expression 
level, only two {9.5%) belonged to HA backgrounds (ST22-IVh, 
t032 and ST5-II, t()02) whereas the majority (90.5 »/o, n= 19) were 
represented by CA backgrounds (Table 1 and Table 2). Moreover, 
two additional CA strains, ST72-IVa-tl48 and ST8-MSSA-t008, 
showed expression rates near the cutoff value (5.73), with 5.76 and 
5.87, respectively. These were considered as belonging to the low- 
level expression group. 

The three strains with the highest expression rate were ST398- 
VII-tl08 (2.85), ST398-IVa-t011 (3.25) and ST25-MSSA-t2909 
(4.16) and strains witii the lowest rate were ST 1806 (TLV ST22)- 
IVh-t747 (9.28), ST45-IVa-t004 (10.38) and STl-MSSA-tl27 
(11.09). 

We observed that some promoters and gene alleles (P6, PI 2/ 
P13, P21; and hla7, hla9, hlall) were exclusively associated to a 



high expression level profile, whUe otiiers (P3/P4/P5, P7, P8/P9, 
Pll, P15, P17/P18, P23; and hla4, hlaS, hlalO) were exclusively 

associated to a low expression level (Table 1 and 2). But we also 
found promoter and gene allotypes that were associated to both 
high and low expression levels. 

Discussion 

Although Hla is one of the most important S. aureus virulence 
factors [10], to the best of our knowledge, this is the first study in 
which the variation in hla nucleotide sequence and gene 
expression was assessed in such a large and representative 
collection. 

We found that the nucleotide sequence of hla was highly 
diverse. The high degree of diversity found within hla is in 
accordance to results obtained for other exotoxins, which are 
generally highly polymorphic [27]. Four non-synonymous substi- 
tutions (Q87 stop codon, D208E, S239T and T243S) were 
identified, that are located in two structural protein domains which 
are essential for Hla oligomerization and pore formation (Rim and 
Cap) [11,28,29]. The impact of these amino acid (aa) changes on 
hla activity is uncertain. If by one hand, the aa changes described 
implicate differences in the molecular weight of the aa, that can 
have influence in the three dimensional structure stabihty and 
activity of the protein; on the other hand these aa changes did not 
match any of the aa previously described to be essential for Hla 
pore formation. 

Furthermore, Walker and Bayley showed that multiple muta- 
tions in this same region (residues spanning Hla235-250) did not 
alter Hla activity in terms of binding, oligomerization or lysis. 
Thus, it would not be expected that S239T or T243S had 
significant biological impact in terms of toxin function. The unique 
mutation with an identified role in Hla function is the stop codon 
found in the ST36 and ST30 strains that was previously described 
by DeLeo and co-authors [30] to hinder toxin production and to 
originate a less virulent strain in a murine infection model. The 
true effect of the non-synonymous substitutions identified in our 
study in the activity of the protein would have to be tested by the 
construction of site directed mutagenesis mutants and by 
performing binding, oligomerization, hemolysis and in vivo 
models assays. 

The construction of phylogenetic trees from the hla defined the 
existence of two clusters with different levels of genetic diversity 
suggesting that hla is evolving at different rates in different genetic 
backgrounds. Interestingly, the most diverse cluster included the 
clonal types which are presently more disseminated or that 
emerged recentiy (like ST398). This might be related to the fact 
that these clones still need to evolve to evade the human immune 
system and not enough time as elapsed for the most adapted allele 
to have been selected [31]. On the other hand the recombination 
events detected in the hla gene in this study were all in strains 
belonging to the high genetic diversity cluster, suggesting that this 
mechanism might have been important in the most recent hla 
evolution and diversification. 

Interestingly, the phylogenetic tree constructed from the hla 
gene was similar to that constructed from MLST genes, in the 
sense that both trees distributed the different STs similarly in two 
main clusters. This observation suggests that hla gene has evolved 
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Figure 3. HA and CA strains relative expression distribution. Mean of expression rates from three biological replicates. Dashed line 
corresponding to the mean Ct value 5.73 results from the regression tree analysis which split strains in two distinct groups, at spa type level: a) high 
expression group - corresponding to strains with Mean Delta Ct<5.73 and b) low expression group- corresponding to strains with Mean Delta Ct> 
5.73). Highlighted in red are the high expressing strains. 
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together with the S. aureus genetic background. A similar type of 
correlation with tlie genetic background was previously described 
for adhesins, either located in the core genome {clfA, clfB,fnhA, 
map, sdrC, and spa) or accessory genome {ebpS, fnbB, sdrD, and 
sdrE) [32]. Although this was the case for the great majority of 



STs, we observed that four STs (ST22, STl 21, ST59, ST93) were 
located in different clusters in the hla and MLST trees. Our results 
suggest that recombination occurring at the hla level, might 
explain the different clustering of strains belonging to ST22 and 
ST121. No recombination events were, however, detected in 
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Figure 4. Distribution of the relative hla expression. Mean of relative expression of three independent readings. Expression comparison 
between a) IVIRSA and IVISSA and b) HA and CA backgrounds using the Two-tailed Student's t-test. Statistically significance (psO.05) (**). 
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MLST genes or hla sequences of strains belonging to ST59 and 
ST93, suggesting that their displacement in the two trees could 
derive from different phenomena, like random mutation. 

It was previously suggested that CA-MRSA expressed more hla 
than HA-MRSA [9]. Results from our study allowed us to extend 
this conclusion to virtually all epidemic CA, but also in two 
particular cases of HA genetic backgrounds. The CA strains 
belonging to ST398, ST25, ST121 and ST93 showed uniformly 
high relative expression rates and strains belonging to ST36, 
ST45, ST239, ST247 and ST250 showed uniformly low 
expression rates. To understand if in fact these patterns of 
expression are characteristic of these clones, more strains within 
each clone should be studied for hla expression. Nevertheless, we 
could not correlate the hla expression rate with any particular 
polymorphism within the promoter or any aa substitution in the 
hla gene. The results suggest that hla regulation is probably a 
result of combination of factors which are redundant, rather than 
associated to a single genetic event. In fact, it has been 
demonstrated by several authors that alpha-hemolysin is part of 
a complex regulatory network, that includes the main two- 
component systems (TCS) - Agr - that in turn is controlled by a 
diverse pool of regulatory networks that coordinately interact in 
response to external stimulus and cell signals, namely others TCS 
(SaeRS, ArlRS and SrrAB), alternative sigma factors (c^), and 
transcription factors (e.g. SarS, SarT, Rot, SarA, SarZ) [33,34]. 

We showed that hla evolved together with the genetic 
background. Moreover, the most epidemic CA-MRSA genetic 
backgrounds express more hla than the most epidemic HA-MRSA 
genetic backgrounds. However, the finding of frequent strain-to- 
strain variation in the expression level of hla within strains of the 
same clonal types suggests that hla polymorphisms cannot be used 
as genetic markers of virulence and investigators shotdd remain 
cautious when inferring conclusions for the entire MRSA 
population from studies performed with a limited number of 
strains. 
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